Mental Visual Browsing
نویسندگان
چکیده
We present a surprisingly easy-to-use video browser for helping users to pinpoint a specific video shot in mind, within a long video. At each interactive iteration, the only user effort required is to click 1 shot, which most visually relates to the user’s mental target, out of 8 displayed shots. Then, the system updates the browsing model and display another 8 shots for the next iteration. The proposed system is underpinned by a theoretically-sound Bayesian framework that maintains the probabilities of all the video shots segmented from the long video. This framework guarantees that we can find the target shot out of around 1-h video within 3–5 iterations. We believe that our system will perform well in the Video Broswer Showdown game of MMM 2016.
منابع مشابه
MusicLand: Exploratory Browsing in Music Space
Most existing search tools based on query terms focus on direct search activities, where users are assumed to have a clear and precise idea about their search targets. However, if users are uncertain about their targets, they will need to define or refine them with a succession of multiple related queries. Current search tools do not adequately support this process. MusicLand is designed for ex...
متن کاملVisual guided navigation for image retrieval
In this work, we are interested in technologies that will allow users to actively browse and navigate large image databases and to retrieve images through interactive fast browsing and navigation. The development of a browsing/navigation-based image retrieval system has at least two challenges. The first is that the system’s graphical user interface (GUI) should intuitively reflect the distribu...
متن کاملUsing Psychophysiological Sensors to Assess Mental Workload During Web Browsing
Knowledge of the mental workload induced by a Web page is essential for improving users' browsing experience. However, continuously assessing the mental workload during a browsing task is challenging. To address this issue, this paper leverages the correlation between stimuli and physiological responses, which are measured with high-frequency, non-invasive psychophysiological sensors during ver...
متن کاملTRECVID 2003 Experiments at MediaTeam Oulu and VTT
MediaTeam Oulu and VTT Technical Research Centre of Finland participated jointly in semantic feature extraction, manual search and interactive search tasks of TRECVID 2003. We participated to the semantic feature extraction by submitting results to 15 out of the 17 defined semantic categories. Our approach utilized spatio-temporal visual features based on correlations of quantized gradient edge...
متن کاملETANA-CMV: A coordinated multiple view visual browsing interface for ETANA-DL
Archeological research embracing complex Information Technology techniques can result in vast quantities of heterogeneous information from different sites in different formats. ETANA-DL is an Archeological Digital Library (DL), providing services suited for the archeological domain. With a growing collection of records in the DL, it is a challenge to present them in an organized and meaningful ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016